Tibetan alphabet

Tibetan
Type	Abugida
Languages	Tibetan Dzongkha Ladakhi Sikkimese Balti
Time period	c. 650–present
Parent systems	Proto-Sinaitic ^[a] Phoenician ^[a] Aramaic ^[a] Brāhmī Gupta Siddhaṃ Tibetan
Child systems	Limbu Lepcha Phagspa
ISO 15924	`Tibt, 330`
Direction	Left-to-right
Unicode alias	Tibetan
Unicode range	U+0F00–U+0FFF
[a] The Semitic origin of the Brahmic scripts is not universally agreed upon. Note: This page may contain IPA phonetic symbols.

The Tibetan alphabet is an abugida of Indic origin used to write the Tibetan language as well as the Dzongkha language, Denzongkha, Ladakhi language and sometimes the Balti language. The printed form of the alphabet is called uchen script (Tibetan: དབུ་ཅན་, Wylie: dbu-can; "with a head") while the hand-written cursive form used in everyday writing is called umê (Tibetan: དབུ་མེད་, Wylie: dbu-med; "headless"). The alphabet is very closely linked to a broad ethnic Tibetan identity. Besides Tibet, it has also been used for Tibetan languages in Bhutan, India, Nepal, and Pakistan.^[1] The Tibetan alphabet is ancestral to the Limbu alphabet, the Lepcha alphabet,^[2] and the multilingual 'Phags-pa script.^[2]

The Tibetan alphabet is romanized in a variety of ways.^[3] This article employs the Wylie transliteration system.

History of the alphabet

Proto-Sinaitic script? 19 c. BCE Ugaritic 15 c. BCE Proto-Canaanite 14 c. BCE Phoenician 12 c. BCE Greek 8 c. BCE Georgian 3 c. BCE Etruscan 8 c. BCE Latin 7 c. BCE Runic 2 c. CE Coptic 3 c. CE Gothic 3 c. CE Armenian 405 Glagolitic 862 Cyrillic c. 940 Aramaic 8 c. BCE Hebrew 3 c. BCE Thaana 4 c. BCE Pahlavi 3 c. BCE Avestan 4 c. CE Palmyrene 2 c. BCE Early Steppean 2 c. BC Proto Rovas 1 c. CE Carpathian Basin Rovas 7 c. CE Szekely-Hungarian Rovas 8 c. CE Khazarian Rovas 7 c. CE Orkhon (Old Turkic) 6 c. CE Syriac 2 c. BCE Sogdian 2 c. BCE Old Uyghur Mongolian 1204 Nabataean 2 c. BCE Arabic 4 c. CE Mandaic 2 c. CE Paleohispanic 7 c. BCE Paleo-Hebrew 10 c. BCE Samaritan 6 c. BCE Epigraphic South Arabian 9 c. BCE Ge’ez 5–6 c. BCE

Meroitic 3 c. BCE

Ogham 4 c. CE

Hangul 1443

Zhuyin (Bopomofo) 1913

1 History
2 Description
3 Transliteration of Sanskrit
- 3.1 Vowels
- 3.2 Consonants
4 Unicode
5 See also
6 Notes
7 References
8 External links

History

The creation of the Tibetan alphabet is attributed to Thonmi Sambhota of the mid-7th century. Tradition holds that Thonmi Sambhota, a minister of Songtsen Gampo (569-649), was sent to India to study the art of writing, and upon his return introduced the alphabet. The form of the letters is based on an Indic alphabet of that period.^[4]

Three orthographic standardizations were developed. The most important, an official orthography aimed to facilitate the translation of Buddhist scriptures, emerged during the early 9th century. Standard orthography has not altered since then, while the spoken language has changed by, for example, losing complex consonant clusters. As a result, in all modern Tibetan dialects, in particular in the Standard Tibetan of Lhasa, there is a great divergence between spelling (which reflects the 9th-century spoken Tibetan) and pronunciation. This divergence is the basis of an argument in favour of spelling reform, to write Tibetan "as it is pronounced", for example, writing "Kagyu" instead of "Bka'-rgyud". In contrast, the pronunciation of the Balti, Ladakhi and Burig languages adheres more closely to the archaic spelling.

Brāhmī

The Brahmic script and its descendants

Northern Brahmic Kusan Tocharian Meitei Mayek Gupta Śāradā Landa Old Kashmiri Gurmukhī Khojki Khudawadi Takri Dogri Chameali Siddhaṃ Tibetan ’Phagspa Hangul (hypothetical) Lepcha Limbu Nāgarī Devanāgarī Modi Nandināgarī Gujarati Anga Script Proto-Bengali Kaithi Sylheti Nagari Eastern Nagari Assamese Bengali Tirhuta Nepal Bhujimol Prachalit Nepal Ranjana Soyombo

Southern Brahmic Tamil Brahmi Vatteluttu Kolezhuthu Tamil Pallava Grantha Malayalam Tulu Sinhala Dhives Akuru Saurashtra Khmer Lao Thai Cham Old Kawi Balinese Javanese Baybayin Batak Buhid Hanunó'o Tagbanwa Sundanese Lontara Rejang Mon Burmese Ojhopath Kalinga Oriya Bhattiprolu Script Kadamba Kannada Telugu Tai Le New Tai Lue Ahom

Description

The Tibetan alphabet has 30 consonants, sometimes known as radicals, which are the basis of the script.^[2]

ཀ ka	ཁ kha	ག ga	ང nga
ཅ ca	ཆ cha	ཇ ja	ཉ nya
ཏ ta	ཐ tha	ད da	ན na
པ pa	ཕ pha	བ ba	མ ma
ཙ tsa	ཚ tsha	ཛ dza	ཝ wa (not originally part of the alphabet)^[5]
ཞ zha ^[6]	ཟ za	འ 'a ^[7]
ཡ ya	ར ra	ལ la
ཤ sha ^[6]	ས sa	ཧ ha ^[8]
ཨ a

As in other Indic scripts, each consonant letter assumes an inherent /a/. However, a unique aspect of the Tibetan script is that the consonants can be written either as radicals, or they can be written in other forms, such as superscripts and subscripts. The superscript position above a radical is reserved for the consonants r, l, and s, while the subscript position under a radical is for the consonants y, r, l, and w. To understand how this works, one can look at the radical "ka" and see what happens when it becomes "kra" or "rka". In both cases, the symbol for "ka" is used, but when the r is in the middle of the consonant and vowel, it is added as a subscript. On the other hand, when the r comes before the consonant and vowel, it is added as a superscript.^[2] R actually changes form when it is above most other consonants; thus རྐ rka. However, an exception to this is the cluster རྙ rnya. Similarly, the consonants w, r, and y change form when they are beneath other consonants; thus ཀྭ kwa; ཀྲ kra; ཀྱ kya.

Besides being written as subscripts and superscripts, some consonants can also be placed in prescript, postscript, or post-postscript positions. For instance, the consonants g, d, b, m, and ’a ("’a chung") can be used in the prescript position to the left of other radicals, while the position after a radical (the postscript position), can be held by the ten consonants g, n, b, d, m, ’a, r, n̄, s, and l. The third position, the post-postscript position, is solely for the consonants d and s.^[2]

The vowels used in the alphabet are a, i, u, e, and o. While the vowel a is included in each consonant or radical, the other vowels are indicated by marks; thus ཀ ka, ཀི ki, ཀུ ku, ཀེ ke, ཀོ ko. The vowels i, e, and o are placed above consonants as diacritics, while the vowel u is placed underneath consonants.^[2] Old Tibetan included a gigu 'verso' of uncertain meaning. There is no distinction between long and short vowels in written Tibetan, except in loanwords, especially transcribed from the Sanskrit.

In the Tibetan script, the syllables are written from left to right.^[9] Syllables are separated by a tseg (་); since many Tibetan words are monosyllabic, this mark often functions almost as a space. Spaces are not used to divide words.

Although some Tibetan dialects are tonal, the language had no tone at the time of the script's invention, and there are no dedicated glyphs for tone. However, since tones developed from segmental features they can usually be correctly predicted by the archaic spelling of Tibetan words.

As in other Indic scripts, clustered consonants are often stacked vertically. Unfortunately, some fonts and applications do not support this behavior for Tibetan, so these examples may not display properly; you might have to download a font such as Tibetan Machine Uni.

Transliteration of Sanskrit

Vowels

Devanagari	IAST	Tibetan	Dependent vowel signs	Devanagari	IAST	Tibetan	Dependent vowel signs
अ	a	ཨ		औ	au	ཨཽ	ཽ
आ	ā	ཨཱ	ཱ	ऋ	ṛ	རྀ	ྲྀ
इ	i	ཨི	ི	ॠ	ṝ	རཱྀ	ཷ
ई	ī	ཨཱི	ཱི	ऌ	ḷ	ལྀ	ླྀ
उ	u	ཨུ	ུ	ॡ	ḹ	ལཱྀ	ཹ
ऊ	ū	ཨཱུ	ཱུ	अं	aṃ	ཨཾ	ཾ
ए	e	ཨེ	ེ	अँ	aṃ	ཨྃ	ྃ
ऐ	ai	ཨཻ	ཻ	अः	aḥ	ཨཿ	ཿ
ओ	o	ཨོ	ོ

Consonants

Devanagari	IAST	Tibetan	Devanagari	IAST	Tibetan
क	ka	ཀ	द	da	ད
ख	kha	ཁ	ध	dha	དྷ
ग	ga	ག	न	na	ན
घ	gha	གྷ	प	pa	པ
ङ	ṅa	ང	फ	pha	ཕ
च	ca	ཙ	ब	ba	བ
छ	cha	ཚ	भ	bha	བྷ
ज	ja	ཛ	म	ma	མ
झ	jha	ཛྷ	य	ya	ཡ
ञ	ña	ཉ	र	ra	ར
ट	ṭa	ཊ	ल	la	ལ
ठ	ṭha	ཋ	व	va	ཝ
ड	ḍa	ཌ	श	śa	ཤ
ढ	ḍha	ཌྷ	ष	ṣa	ཥ
ण	ṇa	ཎ	स	sa	ས
त	ta	ཏ	ह	ha	ཧ
थ	tha	ཐ	क्ष	kṣa	ཀྵ

The Sanskrit "cerebral" (retroflex) consonants ट ठ ड ण ष (ṭa, ṭha, ḍa, ṇa, ṣa) are represented by the reversing the letters ཏ ཐ ད ན ཤ (ta, tha, da, na, sha) to give ཊ ཋ ཌ ཎ ཥ (Ta, Tha, Da, Na, Sa).

It is a classic rule to transliterate च छ ज झ (ca cha ja jha) to ཙ ཚ ཛ ཛྷ (tsa tsha dza dzha), respectively. Nowadays, ཅ ཆ ཇ ཇྷ (ca cha ja jha) can also be used.

Unicode

Tibetan was originally one of the scripts in the first version of Unicode Standerd in 1991, in the Unicode block U+1000–U+104F. However, in 1993, in version 1.1, it was removed (the code points it took up would later be used for the Burmese script in version 3.0). The Tibetan script was re-added in July, 1996 with the release of version 2.0.

The Unicode block for Tibetan is U+0F00–U+0FFF. It includes letters, digits and various punctuation marks and special symbols used in religious texts. Grey areas indicate non-assigned code points:

Tibetan^[1] Unicode.org chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+0F0x	ༀ	༁	༂	༃	༄	༅	༆	༇	༈	༉	༊	་	༌	།	༎	༏
U+0F1x	༐	༑	༒	༓	༔	༕	༖	༗	༘	༙	༚	༛	༜	༝	༞	༟
U+0F2x	༠	༡	༢	༣	༤	༥	༦	༧	༨	༩	༪	༫	༬	༭	༮	༯
U+0F3x	༰	༱	༲	༳	༴	༵	༶	༷	༸	༹	༺	༻	༼	༽	༾	༿
U+0F4x	ཀ	ཁ	ག	གྷ	ང	ཅ	ཆ	ཇ		ཉ	ཊ	ཋ	ཌ	ཌྷ	ཎ	ཏ
U+0F5x	ཐ	ད	དྷ	ན	པ	ཕ	བ	བྷ	མ	ཙ	ཚ	ཛ	ཛྷ	ཝ	ཞ	ཟ
U+0F6x	འ	ཡ	ར	ལ	ཤ	ཥ	ས	ཧ	ཨ	ཀྵ	ཪ	ཫ	ཬ
U+0F7x		ཱ	ི	ཱི	ུ	ཱུ	ྲྀ	ཷ	ླྀ	ཹ	ེ	ཻ	ོ	ཽ	ཾ	ཿ
U+0F8x	ྀ	ཱྀ	ྂ	ྃ	྄	྅	྆	྇	ྈ	ྉ	ྊ	ྋ	ྌ	ྍ	ྎ	ྏ
U+0F9x	ྐ	ྑ	ྒ	ྒྷ	ྔ	ྕ	ྖ	ྗ		ྙ	ྚ	ྛ	ྜ	ྜྷ	ྞ	ྟ
U+0FAx	ྠ	ྡ	ྡྷ	ྣ	ྤ	ྥ	ྦ	ྦྷ	ྨ	ྩ	ྪ	ྫ	ྫྷ	ྭ	ྮ	ྯ
U+0FBx	ྰ	ྱ	ྲ	ླ	ྴ	ྵ	ྶ	ྷ	ྸ	ྐྵ	ྺ	ྻ	ྼ		྾	྿
U+0FCx	࿀	࿁	࿂	࿃	࿄	࿅	࿆	࿇	࿈	࿉	࿊	࿋	࿌		࿎	࿏
U+0FDx	࿐	࿑	࿒	࿓	࿔	࿕	࿖	࿗	࿘	࿙	࿚
U+0FEx
U+0FFx
Notes 1.^ As of Unicode version 6.0

Notes

^ Chamberlain 2008
^ ^a ^b ^c ^d ^e ^f Daniels, Peter T. and William Bright. The World’s Writing Systems. New York: Oxford University Press, 1996.
^ See for instance [1] [2]
^ Which specific Indic script inspired the Tibetan alphabet remains controversial. Recent study suggests Tibetan script was based on an adaption from Khotan of the Indian Brahmi and Gupta scripts taught to Thonmi Sambhota in Kashmir (Berzin, Alexander. A Survey of Tibetan History - Reading notes taken by Alexander Berzin from Tsepon, W. D. Shakabpa, Tibet: A Political History. New Haven, Yale University Press, 1967: http://www.berzinarchives.com/web/en/archives/e-books/unpublished_manuscripts/survey_tibetan_history/chapter_1.html).
^ Old Tibetan had no letter w, which was instead a digraph for 'w.
^ ^a ^b In the case of zh and sh the h signifies palatalization.
^ The h or apostrophe (’) usually signifies aspiration.
^ The single letter h represents a voiceless glottal fricative.
^ Asher, R. E. ed. The Encyclopedia of Language and Linguistics. Tarrytown, N. Y.: Pergamon Press, 1994. 10 vol.

References

Asher, R. E. ed. The Encyclopedia of Language and Linguistics. Tarrytown, NY: Pergamon Press, 1994. 10 vol.
Beyer, Stephan V. (1993). The Classical Tibetan Language. Reprinted by Delhi: Sri Satguru.
Chamberlain, Bradford Lynn. 2008. Script selection for Tibetan-related languages in multiscriptal environments. International Journal of the Sociology of Language 192:117–132.
Csoma de Kőrös, Alexander (1983). A Grammar of the Tibetan Language. Reprinted by Delhi: Sri Satguru.
Csoma de Kőrös, Alexander (1980–1982). Sanskrit-Tibetan-English Vocabulary. 2 vols. Reprinted by Delhi: Sri Satguru.
Daniels, Peter T. and William Bright. The World’s Writing Systems. New York: Oxford University Press, 1996.
Das, Sarat Chandra: “The sacred and ornamental characters of Tibet”. Journal of the Asiatic Society of Bengal, vol. 57 (1888), pp. 41–48 and 9 plates.
Das, Sarat Chandra (1996). An Introduction to the Grammar of the Tibetan Language. Reprinted by Delhi: Motilal Banarsidass.
Jäschke, Heinrich August (1989). Tibetan Grammar. Corrected by Sunil Gupta. Reprinted by Delhi: Sri Satguru.

External links

Tibetan Calligraphy—how to write the Tibetan script.
Learning to Write Tibetan by Don-grub (PDF 14 MB / DjVu 6.11 MB)
Unicode area U0F00-U0FFF, Tibetan script (162KB)
Encoding Model of the Tibetan Script in the UCS
An index of documents related to the Encoding of Tibetan in the Unicode / ISO 10646 standards
Jomolhari Font—Unicode compatible. Download
Overview of Tibetan Unicode fonts
Tibetan scripts and conservation by Tashi Mannox
2 fonts—not Unicode compatible.
2 fonts: 1 Macintosh, not Unicode compatible.
Origins of tibetan calligraphy: History of Tibetan script and guide to Tibetan script.
Omniglot's Guide to the Tibetan writing system
Tibetan Scripts, Fonts & Related Issues—THDL articles on Unicode font issues; free cross-platform OpenType fonts—Unicode compatible.
Elements of The Tibetan writing system.
Introduction to Tibetan Orthography, at Kuro5hin
Free Tibetan Fonts Project
Ancient Scripts: Tibetan
Daicing Alphabet of Tibetan Transliteration (DAO) & Daicing Tibetan Keyboard

Tibetan language topics

Tibetan languages · Standard Tibetan · Classical Tibetan · Grammar Script: Umê (Zhuza, Bêcug), Uchen (Chuyik/Khyungyik), Bamyik · Regional (Joyig, Monyig and Lhoyig) · Transcription: Wylie, Tibetan pinyin, THL Transcription

Types of writing systems

Overview	History of writing Grapheme

Lists	Writing systems undeciphered inventors Languages by writing system / by first written accounts

Types

Abjads

Numerals Aramaic Arabic Pitman shorthand Hebrew Jawi Nabataean Pahlavi Pegon Phoenician Proto-Canaanite Psalter Samaritan South Arabian Sogdian Syriac Tifinagh Ugaritic

Abugidas

Brahmic	Ahom Balinese Batak Baybayin Brāhmī Buhid Burmese Chakma Cham Devanāgarī Dhives Akuru Eastern Nagari Grantha Gujarati Gupta Gurmukhī Hanunó'o Javanese Kadamba Kaithi Kalinga Kannada Khmer Lanna Lao Lepcha Limbu Lontara Malayalam Meitei Mayek Mithilakshar Modi Mon Nāgarī Nepali Old Kawi Oriya Pallava 'Phags-pa Ranjana Rejang Rencong Śāradā Saurashtra Sinhala Siddhaṃ Soyombo Sundanese Sylheti Nagari Tagbanwa Tai Dam Tai Le Takri Tamil Telugu Thai Tibetan Tocharian Varang Kshiti

Others	Boyd's syllabic shorthand Canadian Aboriginal Ge'ez Japanese braille Kharoṣṭhī Meroitic Pollard Sorang Sompeng Tāna Thomas Natural Shorthand

Alphabets

Linear	Armenian Avestan Bassa Vah Borama Coptic Cyrillic Deseret Duployan shorthand Eclectic shorthand Elbasan Fraser Gabelsberger shorthand Georgian Glagolitic Gothic Gregg shorthand Greek Greco-Iberian alphabet Hangul International Phonetic Kaddare Latin Manchu Mandaic Mongolian Neo-Tifinagh New Tai Lue N'Ko Ogham Ol Chiki Old Hungarian Old Italic Old Permic Orkhon Osmanya Runic Shavian alphabet Visible Speech Vithkuqi

Non-linear	Braille Hebrew Korean Maritime flags Morse code New York Point Semaphore line Flag semaphore Moon type

Ideo/Pictograms

Aztec Blissymbol DanceWriting Dongba Míkmaq New Epoch Notation Painting Nsibidi SignWriting

Logograms

Chinese	Traditional Simplified Hanja Hán tự Kanji

Chinese-based	Chữ Nôm Jurchen Khitan large script Tangut Zhuang

Other logo-syllabic	Anatolian Cuneiform Maya Yi

Logo-consonantal	Demotic Hieratic Hieroglyphs

Numerals	Hindu-Arabic Abjad Greek (Attic) Roman

Semi-syllabaries

Full	Celtiberian Northeastern Iberian Southeastern Iberian

Redundant	Southwest Paleohispanic Pahawh Hmong Zhùyīn fúhào Khitan small script

Syllabaries

Afaka Cherokee Cypriot Geba Hiragana Katakana Kikakui Kpelle Linear B Man'yōgana Nüshu Old Persian Cuneiform Vai Woleai Yi Yugtun